Minimum error discrimination problem for pure qubit states 
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The necessary and sufficient conditions for minimization of the generalized rate error for discrim- 
inating among A'^ pure qubit states are reformulated in terms of Bloch vectors representing the 
states. For the direct optimization problem an algorithmic solution to these conditions is indicated. 
A solution to the inverse optimization problem is given. General results are widely illustrated by 
particular cases of equiprobable states and A*' = 2, 3, 4 pure qubit states given with different prior 
probabilities. 
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I. INTRODUCTION 



Essential advances in experimental techniques in re- 
cent years made possible high precision measurements 
which are required for distinguishing among separate 
nonorthogonal quantum states. These studies are stim- 
ulated by needs of quantum communication and quan- 
tum cryptography where the discrimination of quantum 
states is one of the key problems. In a more general 
context, this problem underlies many of the communi- 
cation and computing schemes that have been suggested 
so far According to quantum- mechanical laws there 
is no way to discriminate perfectly among nonorthogonal 
states. Therefore, an actual problem is the state discrim- 
ination in an optimal way. Currently, different criteria of 
optimality are formulated (for a recent review, see 0]). 
In this paper we will consider discrimination strategies 
based on the minimization of the generalized rate error 
(or more generally, the mean Bayes' cost [3, 3). They 
are known in the literature as the minimum- error dis- 
crimination strategies. 

Necessary and sufficient conditions for realizing the 
minimum-error (generalized) measurement were formu- 
lated independently by Holevo [1, [1] and Yuen et al 
(see also a recent appealingly simple proof in P). Un- 
fortunately, as Hunter Q pointed out, these conditions 
do not provide a great insight into either the form of 
minimum-error measurement strategies, or into how er- 
ror probability depends on the set of possible states. 
Moreover, the only strategy derived from these condi- 
tions corresponds to two possible states jS]. All other 
optimal solutions, except the one proposed by Hunter 
Q for equiprobable qubit states, were first postulated 
and then shown to satisfy these conditions. Actually, 
Hunter's solution Q is based on solving these conditions 
only partly and consists in the formulation of a two-step 
procedure. First, he proposed to find some auxiliary op- 
erators and then to check which of these operators cor- 
respond to possible measurements. In this way, he was 
able to find a solution for equiprobable qubit states with 
a clear geometrical visualization. We have to note that 
our approach based on the full solution of the necessary 
and sufhcient conditions permitted us to draw slightly 
different conclusions compared to these published in Q. 



In the current paper, we first formulate conditions, to 
which the Lagrange operator should satisfy, when a mea- 
surement strategy remains to be optimal after a given 
set of states is enlarged by new states. Then we refor- 
mulate the known necessary and sufficient optimization 
conditions for pure qubit states given with arbitrary prior 
probabilities in terms of Bloch vectors representing the 
states. This permitted us to indicate an algorithmic solu- 
tion of the (direct) optimization problem. Furthermore, 
using the new form of the necessary and sufficient op- 
timization, conditions we succeeded to solve the inverse 
optimization problem, i.e., the problem of finding all sets 
of states and prior probabilities for which a given mea- 
surement strategy is optimal. For two-qubit states, we 
found a solution to the optimization problem for the case 
when the states are given with probabilities pi and p2 
such that pi -\- P2 < 1- For pi -\- P2 = 1, our solution 
reduces to the one first found by Helstrom 0] (see also 
[2|). Using this result, we derived a condition for TV qubit 
states pi , . . . , piq when they are optimally discriminated 
by the same orthogonal measurement which is optimal for 
discriminating among the states pi and p2 ■ This permit- 
ted us to formulate conditions for distinguishing projec- 
tive measurements from generalized ones for optimal dis- 
crimination among three qubit states. For N equiproba- 
ble states, we have shown that the optimal (generalized) 
measurement may contain M > 4 elements of a positive 
operator- valued measure (POVM) only if there exists a 
subset of M states which may form a POVM. Otherwise, 
the optimal POVM contains either three or two elements. 
We applied our algorithm to the case of three states, two 
of which are given with equal probabilities. For a par- 
ticular case of three mirror symmetric states, our result 
coincides with the one previously published by Ander- 
son et al. For another particular case, we compared 
our solution to a solution obtained numerically in [11]. In 
contrast to ^111] , our approach gives exact analytic bounds 
for the prior probability, distinguishing projective opti- 
mal measurement from the generalized one. As another 
illustration of our approach, we obtained the optimal so- 
lution for four qubit states, two of which are given with 
the probability p and two others with the probability 
1/2 -p. 
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II. MINIMUM ERROR CONDITIONS FOR 
PURE QUBIT STATES 

A. Minimum error conditions 

Assume that we are given A^(> 2) pure qubit states 
Pj-p]=p->0, Trp, = l, j^l,...,N (1) 

and a (discrete) probability distribution pj > 0, 

J2'j'=iPj = 1- We assume also that the operator pj 
occurs with the probability pj and acts on the vectors 
of a two-dimensional Hilbert space with the usually de- 
fined inner product. A (generalized, see, e.g., mea- 
surement that discriminates between the states pi,. . . ,pjv 
can be described with the help of N detection operators 
TTi,. . . jTTjv such that 

7r]=7r, >0, (2) 

N 

Y.^,=I (3) 

3 = 1 

where / is the identity operator. If as a measurement 
result the detector tt^q clicks we know with the probabil- 
ity Tr(/9if,7ri,-|) that the chosen state is pi,,. The overall 
probability of correctly identifying any of the states pj is 
then given by (see, e.g., 0) 

N 

Pcorr = '^Pi Tr(7riPi) = Tr T (4) 

i=l 

where we introduced 

N 

i=i 

and Perr = 1 — Pcorr is the overall probability of an erro- 
neous guess. The minimum error discrimination strategy 
corresponds to such a set of detection operators nj that 
Perr takcs its minimal value or equivalently Pcorr takes 
its maximal value. Operator F ([5]) is known in the lit- 
erature as the Lagrange operator (see e.g. Q). Since its 
trace defines Pcorr we will call its matrix representation 
in a basis the cost matrix. 

According to d, 0] j the necessary and sufficient condi- 
tions for a measurement to be optimal are 

r = rt , (6) 

G,:=r - p,p,>0, j = l,...,N. (7) 

As it was proved by Holevo [1], instead of F, one can 
equivalently use F = (F + Ft)/2 and at the extrcmum 
point F = F. Below, we will use also the following impli- 
cation of Eq. O 

G,7rj=0, j = l,...,iV. (8) 



It is not difficult to see that from Eq. ([8]), it follows that 
for qubit states, if both itj ^ and Gj ^ 0, then they 
both have a zero eigenvalue and therefore they both are 
proportional to projectors (cf. Indeed, since tTj is 

Hermitian one can choose a coordinate system where it 
is diagonal with at least one non-zero diagonal entry. In 
this system from ([S]), it follows that at least one column 
of Gj and one diagonal element of tTj should be equal 
to zero, i.e., both detfrj = and detGj = and both 
these operators are proportional to projectors. In what 
follows, we will extensively use this property. 

We would like to note here that for an optimal strategy, 
some of the operators tTj may become equal to zero. This 
happens if some of the detectors tTj , say for definiteness 
the detectors with the numbers j = Af + 1, . . . , iV, never 
click. In this case, only the states pj, with j = 1, . . . , M, 
enter the cost matrix F ^ and their probabilities satisfy 
the condition Yl!jLi Pj < 1 while the first M POVM el- 
ements, TTj, j — 1, . . . , M, satisfy conditions ^ and ^ 
with N = M . We would like to stress that this property 
may be formulated in a more constructive way. 

Proposition 1. Assume that for M states pj, j = 
l,...,Af, given with the probabilities pj such that 

'l2^LiPj < 1; detection operators TTj, satisfying condi- 
tions (0j and (0) at N — M , which maximizes the success 
probability Pcorr |^ for the cost matrix F at N = M , 
are known. Then for a larger set of N > M states pj, 
j — 1, . . . , N given with the probabilities pj, X^jLi Pj — 1; 
such that Pj = Pj andpj = pj , j — 1, . . . , M , the optimal 
POVM contains all previous elements itj, j — 1, . . . , M 
plus the zero elements TTj = 0, j = M +1, . . . , N provided 

r-p,p,>0, j=M+l,...,N. (9) 

The optimal success probability for the new s 
p 

^ corr • 

As the final comment of this section, we note that in 
what follows, we will distinguish the projective (i.e., von 
Neumann) measurement from the generalized one. The 
projective measurement may be realized only if there are 
no more than two detection operators in which case they 
should satisfy the usual condition n'^ = TTj. For the gen- 
eralized measurement one needs more than two detection 
operators. 



B. Minimum error conditions in terms of Bloch 
vectors 

We find convenient to choose Pauli matrices cr^ {i — 
1, 2, 3) and cto = / as a basis in the linear space of Her- 
mitian 2x2 matrices 

2 T 

ajO-k = —a-kO-j = lai , cr^ = do = , 
TrfTj=0, Tr(fT,(7fc) = 2(5,-fc, j, fc, Z = 1, 2, 3 . (10) 
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Thus, once N states pj are given, the coefScients (3j^k 
defining the states 



with the vectors 7^ such that '^i7i ~ 0- 

Now we can prove the following statement. 



3 

k=0 



(11) 



may be found as Pj^k — 5Tr(pj(Tfc) G M. Normalization 
condition ([T]) imposes the following restrictions on (3j^k- 



/3. 



'3,0 



k=l 



(12) 



Therefore, any state is defined by a point on a sphere with 
the radius equal to 1 /2 or equivalently by a 3-dimensional 

real vector (Bloch vector) (ij = {(3j.i, (3j.2, Pj.s) of the 
length equal to 1/2. 

As it was argued in the previous section we may look 
for the detection operators in the form 



% = ^3^j : = T^3 = 7r| > , UJj > . 



(13) 



We shall call tOj the frequencies. Taking into account 
Proposition [TJ we may look for M operators ttj optimiz- 
ing the measurement strategy for M < N states pj, j = 
1, . . . , M, given with the probabilities pj, Pj — 1- 

The problem is solved if such a strategy is found and the 
remaining states (if M < N) satisfy conditions Q . Note 
that the optimization problem has always a solution (see 

my 

Operators TTj will be defined in terms of unknown co- 
efficients 7j,fc, 



3 

^"^Ij.kCTk 

k=0 



which should satisfy the relations similar to Eq. H 



7j,o 



3 

Y.^3,k 
k=l 



(14) 



(15) 



and every pj is also defined by a 3-dimensional real vector 
7i = (7j,i!7i,2,7j,3) of the length equal to 1/2. Moreover, 
from Eq. Q one finds additional restrictions 



0, fc = l,2,3, 



M 

M 

^ = 2 , ujj>0. 

3 = i 



(16) 



(17) 



According to Eq. ([5|) the cost matrix is also defined in 
terms of the vectors l3j and , 



3 

E 

k,k' = l 
M 



^k,k'CFk<Jk' 



Tfc^fc' = ^^3P3P3:kl3,> 
3 = 1 



(18) 
(19) 



Proposition 2. Ifojj > 0, j — 1, . . . , M, then conditions 
^ are satisfied for frj — ujjTTj, where tTj are given by Eq. 
14^ with 

l3,k^ ^'-'f'~^\ fc = l,2,3, j = l,...,M, (20) 

A - Pj 

where 

A^TvT>p,, Vj (21) 
and Bk G K are arbitrary parameters. 
Proof. According to Eqs. pB]) and pU)) . one may write 

33 3 



r = ^ rfe^fco-Q + ^(rfc,o + ro,fc)f''fc + E rfc.fc'^fe^fe' > 

k=/^k' = l 

(22) 



fc=0 



fc=i 



where [see Eqs. (HH), (HH) and JH])] 



1 

T/CO = -^^^3Pjp3-k: 



To.* 



1 

;j E '^0P3l3,k ■ 



(23) 



(24) 



As it was mentioned in Sec. Ill Al instead of F, one can 
equivalently use F = (F + r^)/2 which in our case is 
given by the first two terms at the right-hand side of Eq. 
(P^ . Therefore, below, to simplify notations, we will 
mean by F the same expression p2|) where the last term 
is absent. This is justified by the property which we will 
show below. Namely, we will show that from Eq. ([5]) 
follows the Hermitian character of F and hence the zero 
contribution to F from the last term in Eq. (P^ . This 
leads to the following expression for Gf. ([7]) 



\fc=0 / k=l 



+ To,k-pj(3j^k)a-k . 

(25) 

Using Eqs. and ((25|l and assumed property ujj ^ 0, 
i.e., GjTTj — 0, we obtain a set of equations for coefficients 

7i,fc, 

3^3 
^^k,k-^P3+2^{Tkfi + To.k-PjPj,khj.k = 0, (26) 



k=0 



fe=l 



Tfc.o + ^o,k~Pj{Pj,k + l3,k) + '2jj,k ^ ^k',k' = , (27) 



fe'=0 



(Tkfi + ^o,k - PjPj,k)ij.k' = (rfe'.o + ro,fc' - PjPj.k')ij.k ■ 

(28) 
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If we multiply Eq. (|26p by 7^^^ and subtract the result 
from Eq. we get 

(rfc,o + ro,fc - PjPj,khj,k' 

3 

= 4 ^ (rfc",o + ro,fc" - PjPjM")i3,k"ijMij,k' ■ (29) 

fc"=0 

The right- (and hence the left-) hand side of Eq. is 
symmetric with respect to the permutation of k and k' 
and therefore Eq. ([28)) is an implication of Eqs. ([26]) and 
(P7)) . From the other hand, multiplying Eq. (|?7|) by ^ 
and summing up over fc gives just Eq. (|26|1 . From Eq. 
(I27p follows the symmetry property Tk,k' = Efc/ fc and, 
hence, the Hermitian character of the matrix F. Indeed, 
multiplying Eq. ([27]) by tojjj^k', summing up over j, and 
using Eqs. ^ and ^ yields 

M 

^k,k' = - '^P]^]l],klj,k' =rfe/_fc, fc,A:' = l,2,3. 
i=i 

(30) 

Thus from the set of Eqs. (P5)) - (|28p . we have to solve 
Eq. ([27l) only. For this purpose, we denote 

Xj^k := ^k.o + ro,fc - Pj{P],k + lj,k) ■ (31) 

These quantities have two remarkable properties. The 
first property 

M 

Y,^]Xj^k^O, fc = l,2,3 (32) 

follows from Eqs. (dH), dM]), and ^ and the second 
property 

M 3 ^1 
Y,Y.^no.kX,,k = -Y.Tk,k - - 2 (33) 

j=l k=l k=0 

is a consequence of (fTSj) . (fT6|) . (fTS]). and (fT9|) . Using Eqs. 
(PT|1 and dSSl), we rewrite Eq. as 

A/ 3 

Xj.k = 27j- fc ^ ^ Wjv7j-,fc,Xjvfc, , (34) 
j'=ifc'=i 

which has a solution 

^j,fc = A7,, , (35) 

where A is for the moment an arbitrary constant. Putting 
thus found Xj^k to Eq. ^ and using Eqs. dTS]) and (fTT)) . 
we relate this constant with Pcom ^ = TrF = Pcorr- 

To finish the proof of the statement, we solve Eq. (pij) 
with respect to 7j.fc. For this purpose, we replace in Eq. 
(f3T|) Ffc^o + To^fc by the sum of Eqs. ^ and ^ which 
yields 

1 ^ 

i'=i 



Then, using Eqs. (|16p and ([TT)) . we find a solution to the 
above equation 

PjiPj.k+lj.k) = -Xj,k+Bk , J = 1, . . . , M , A: = 1, 2, 3 . 

(36) 

Here, parameters Bk remain arbitrary. From here and 
Eq. HMD, we get Eq. 1^. 

Finally, we note once again that since all luj are as- 
sumed to be different from zero, every matrix Gj has a 
zero eigenvalue (cf. [^). Therefore, the condition Gj > 
reduces to Tr Gj > which upon using Eq. P5|) yields 
A>pj. □ 

Note that the proposition[2]leaves parameter A and the 
frequencies ujj unspecified. Nevertheless, if we assume 
the frequencies to be invariant under a scaling transfor- 
mation of the probabilities, i.e., after the replacement 
Pj ~* ctpj, one has ujj —^ ujj, then from Eq. ()19p . it 
follows that A becomes a homogeneous function of the 
first degree with respect to the probabilities pj provided 
the parameters jj^k are also invariant with respect to the 
same scaling. The last property agrees with Eq. ([20]) 
from which the scale invariance of the parameters ^j^k 
follows if A is a homogeneous function of the first degree 
with respect to Pj. Note also that under these assump- 
tions, the detection operators TTj become invariant with 
respect to the same scaling transformation. 

C. Direct optimization problem 

By the direct optimization problem, we mean the prob- 
lem of finding the operators tTj minimizing 

Perr for a 

given set of operators pj. Any set of itj satisfying Eqs. 

dSl), and ^ with F given in Eq. jSl) is a solution to 
this problem. Proposition [5] opens a way for an algorith- 
mic solution to the problem. 

First, we note that the operators nj = ujjiTj are de- 
fined by the frequencies ujj and Bloch vectors jj = 
(7j,i, 7j,2, 7^,3). According to Proposition [21 the vec- 
tors 7j are defined by the parameter A and vector B = 
{Bi, B2, B3). The still undefined parameters A, ojj, and 
B should be found form Eqs. HH), HH), and (HT]). Using 
Eq. ([^ . we obtain from Eq. ([T5]) a set of equations for 
Bk and A, 

11^ 3 

-A^ - -Ap, E ^'^'3^'^ , J = 1, . . . , M . 

k=l k=l 

(37) 

It is convenient to rewrite this system as M — 1 homo- 
geneous equations for A and B^ and an equation of the 
second order with respect to A and Bk ■ For this purpose, 
we subtract from the Eq. (j37p at j = 1 the same equation 
at j = 2, . . . , M, thus obtaining 

3 

4 Bk{pif}i,k - Pjf3j,k) = A{pi -p,), j = 2,...,M 

k=l 

(38) 
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and sum up the Eqs. 
yields 



37|) over j from 1 to M which 



M 



ujj and parameters 7j,fc, we have to find pj and /3j,fe, j = 
1, . . . ,N, k — 1,2,3. We formulate the solution to this 
problem as the following proposition. 



MA^ - 2Aa = 4M ^ - 8 ^ ^ BkPjPj^k , (39) Proposition 3. Let the Block vectors j ^ I, 



k=l 



j = l k=l 



where a = X^jliW- We thus have M equations for four 
unknown parameters A and -61,2,3- Therefore, it is nat- 
ural to expect that the system may have solutions for 
M = 2, 3, 4 and has no solutions for M > 4 except for 
some special cases. This means that except for some par- 
ticular cases, the optimal measurement for iV > 4 pure 
qubit states is realized with four operators tTj or less. 
Thus, if for M = 4 the main determinant of the system 
(l38l) is different from zero, we can find all Bi as linear 



functions of A. The parameter A 
from the second-order Eq. ((39)) . After that, we find 7^,^ 
from Eq. (|20p and have to check whether the systems 
(Uni) and P?|) are compatible. If this is the case, from 
this system one can find the frequencies Uj . If all Uj are 
nonnegative, coj > 0, the problem is solved. Otherwise, 
one has to choose M = 3. We have to note here that 
although in principle the solution of the linear system 
of equations p6| . (|17p may always be found analytically 
(for instance, as a ratio of corresponding determinants), 
its explicit expression may be rather involved (see some 
simple examples below) and difficult for analysis. In such 
cases, numerical solution to this system may be helpful. 

For M = 3, if we add to two equations ([55)1 the com- 
patibility condition of the system (fTB)) considered as a 
homogeneous system with respect to cuj , we obtain an in- 
homogeneous linear system of equations for -61,2,3. This 
system defines Bj as linear functions of A and similar to 
the previous case, Eq. (|55)) becomes a second-order equa- 
tion with respect to A. After that, we can omit one of the 
equations ()16p and from two remaining equations and Eq. 



(fT7)) find LUj. If all ujj > 0, the problem is solved. Oth- 
erwise, we have to choose M — 2. In this case, the mea- 
surement becomes projective (i.e., von Neumann mea- 
surement). Since an optimal measurement exists always 
(see e.g., Q) in this case among N given states, there 
always exist two states (say pi and P2) such that if we 
assign them the probabilities ^Pi,2, P1+P2 = ct, they are 
optimally discriminated with an orthogonal measurement 
and all the other states satisfy the conditions formulated 
in the proposition [Tj This property follows from the in- 
variance of the detection operators with respect to scaling 
transformation pj —^ apj mentioned in Sec. Ill BI Below 
for N = 2 (Sec. IIII A|) this invariance will be demon- 
strated explicitly. 



and numbers loj > such that X]j=i ^jlj 



.N 
and 



N 

j=l ^3 



— 2 defining a POVM -kj — ujjTTj be given 



Then this POVM corresponds to an optimal measurement 
strategy for the states pj given with the probabilities 



by the Block vectors 



whe 



I + (27j- + R)-R 



N 



A=l/F, F = Y, 



1i 



(40) 



(41) 



(42) 



(43) 



where R is an arbitrary real vector and by dot we de- 
note the scalar product of tke vectors from a three- 
dimensional Euclidian space. The optimal success prob- 
ability is Pcorr = A. 

Proof. Some necessary conditions for an optimal mea- 
surement are formulated in Proposition [5] Thus, assum- 
ing the vectors Tj be given in a frame by their coordinates 
7j,/c and using Eq. (|^D)) . we find the coordinates (3j,k of 
the vectors 



{A~p.i)-f,,k + Bk 
Pi 



(44) 



where B^ are arbitrary real numbers which define a vec- 
tor B — (-Bfc). For to be coordinates of Bloch vectors, 
they should satisfy Eq. (fT2)) which may be solved with 
respect to pj , 



Pj = 



+ ELi Bl + 2A Ij^Bk ^ ^^j.^ 



If now we put B ~ AR, i.e., Bk = ARk, we obtain from 
here just Eqs. (|M)) and ([^ . At the same time, Eq. 
(HD) reduces to Eq. (|3T)). Equation ([33]) follows from the 
condition X^^iPj — 1- ^ 



D. Inverse optimization problem 



III. PARTICULAR CASES 



By inverse optimization problem, we mean a problem 
of finding all possible states pj and the prior probabilities 
Pj for which a given measurement strategy is optimal. 
For qubit states, this means that given the frequencies 



N=2 



Although the case of two qubit states is studied in 
details ^^'^ instructive to show how the known 
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solution follows from our approach. Moreover, in view of 
Proposition [1] we need to consider the case where pi + 
P2 — OL < \ which was not studied so far. 

We find convenient to choose a frame in the three- 
dimensional Euclidian space such that the given states pi 
and p2 are defined by the Bloch vectors [3i and /?2 with 
coordinates /3i — (/3i,i, /9i,2, 0) and /?2 = (/3i,i, — /3i,2, 0). 

From Eq. dH]) at fc = 3, we learn that = 0. After 
solving the compatibility condition of two remaining Eqs. 
p6| (with k = l and k = 2) together with Eq. ([38]), we 
find Bi and B2 as linear functions of 



Bi 

B2 
where 



^[A{p,^P2f+8Pl2PlP2iPl+P2)] (46) 



A{pi+p2)-8Pl,pip2], (47) 



^ - -P2)' +/3L(Pi +P2)'] 



Placing thus found Bi and B2 to Eq. 
solve this equation with respect to A, 



,211/2 



(48) 



37|) at j = 1, we 



A^liP. 



P2) 



D. 



(49) 



Here from two roots of Eq. (l37|) . we kept the biggest 
one. With the values A and B12 given in Eq. ([49]) and 
Eqs. and (|T7)) . one can transform the left-hand side 
of Eq. p6|) at A: = 1 to a form (0-12 — wi) times a nonzero 
factor so that one finds from the system (jl6p and (fT7|) 
that (jji = 0^2 = 1. 

Once the coordinates i?i,2,3 of the vector B and the 
optimal success probability Pcorr — A (|49)) are found, we 
may calculate detection operators 



7r2 = / - TTl , 



(50) 



1 



2D 

and the cost matrix 



[PiApi - P2) - iPiaiPi + P2)] (51) 



1/2^ 7* 
4 I 7 



7 = -^{«/3i,2(Pi -P2)(2D +pi +P2) 
+/3i,i[(pi - P2f + "^Dipi + P2)]} . 



(52) 



(53) 



Here we observe the invariance of the detection operators 
(|50p with respect to scaling of the probabilities pj — *■ apj 
mentioned above. 

If we notice that the parameters /3i,i and /3i_2 may be 
expressed in terms of the Bloch vectors (3i and (32 , 



Pi 



{I32+I3l)-P2 
11/32+ All 



Pi 



W2-Pl)-P2 

11/32 -/3i II 



(54) 



where by ||/3|| the length of the vector /3 is denoted, we 
may present parameter D and with it the optimal success 
probability A (|49|) in an explicitly covariant form 



D = 



pI+pI 



-,1/2 



- 2P1P2P1 ■ P2 



(55) 



Another covariant form for A corresponds to using the 
absolute value of the overlap between the state vectors 

('0l|'02) = Si,2, Pi = |'0l)('0l|, P2 = |V'2)(V'2|, 



1 



|si.2r=Tr(pip2) = 7r + 2/3i-/32 



which yields 



A 



Pi +P2 



[iPi 



P2) 



- 4piP2|si,2p] 



1/2 



(56) 



(57) 



For pi + P2 — 1, the last equation gives the value known 
as the Helstrom bound (see also 0]). 

Once the problem for two states given with the prob- 
abilities pi and P2, Pi + P2 < 1, is solved, we can apply 
Proposition [1] for formulating conditions for N > 2 qubit 
states which are optimally discriminated by the same or- 
thogonal measurement which optimally distinguishes the 
given states pi and p2- For that, assuming that the states 
Pj are given with the probabilities pj , we have to calcu- 
late both the trace and the determinant of the matrix 
Gj — r ~ PjPj- After some algebra, one finds 



TtGj=A-p,, 



detG, = -{A-p,) ^ 



(58) 



PjjPi -P2) 
2D 



{PlPl - P2P2) ■ Pj 



+ P, [pi{Pl - Pj)+P2{P2 - Pj)\ ■ Pj ■ (59) 

If both Tr Gj > and det > 0, j 3, . . . , N, then 
the optimal measurement is the projective measurement 
which optimally distinguishes pi from p2 given with the 
probabilities pi and P2, Pi +P2 < 1- 

For N equiprobable states, given with the probability 
p = 1/N, we have pj — p and ^ = p-|-D so that Eq. (IST 
reduces to 



detG,=p^Pi+P2)-{Pj-Pi) 



(60) 



B. iV = 3 

For three given states pi, p2, and pa, it may happen 
that the optimal measurement is the projective measure- 
ment. In such a case, because of the aforementioned scale 
invariance of the detection operators for TV = 2, there are 
three possibilities for the optimal discrimination among 
the states pi, p2, and p^. The optimal measurement may 
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be the projective measurement which optimally discrimi- 
nates among any two from three given states. Therefore, 
to distinguish these possibilities below, we will need to 
put additional labels to quantities A and Gj. In cases 
where there is no confusion, we will use the previous no- 
tations also. Thus, A defined by Eqs. (j49]) and (l55|) will 
be denoted as Ai,2 so that 



A 



1,2 •- 



:(P1 +P2) + 



pI + pI 



-,1/2 



2piP2/3i ■ Pi 



(61) 



and det Gj given in Eq. (|59p at j = 3 will be denoted 
detGi_2;3, i-e., detG'i,2;3 := detGa. For the set of labels 
{1, 2, 3}, we will use also cyclic permutations {1, 2, 3} 
{2, 3, 1} — > {3, 1,2}. Using these notations, we may apply 
the proposition [1] to formulate conditions distinguishing 
projective optimal measurements from generalized ones 
for discriminating among three qubit states. 

Proposition 4. Let the different states pi, p2, and 
he given with the probabilities pi, p2, P3 and (3i, (32, (3^ 
as their Bloch vectors. The optimal measurement dis- 
criminating among these states is the projective measure- 
ment if the inequalities Ai_j > pk and Gij-k ^ hold for 
one cyclic permutation of the labels {i,j,k} = {1,2,3} 
at least. Otherwise, i.e., if no cyclic permutation exists 
such that both above inequalities hold, the optimal mea- 
surement is generalized. In the first case, the optimal 
success probability is Pcorr = Aij . In the 



secona case. 



Pcorr — A where A, should be found from the system of 
equations |j'7|), ilO^) . and [F, 



Proof. First we note that since the states are assumed to 
be different from each other, the optimal POVM contains 
more than one element. The statement becomes evident 
if one notices that the optimal solution exists always and 
the optimal POVM contains three elements if it cannot 
contain two elements. □ 

Note nevertheless that for the general case of three 
qubit states, solution to the system of equations (|37p . 
(fT6|) . and (flT|) is rather involved. Therefore, below we 
will consider some particular cases. 



Equiprobable states 



Let us consider pi = P2 = Ps 
this case, A 



p = l/N (N = 3). In 



P 1 



2/3i ■ P2 ] > P and whether 



the optimal measurement is projective or generalized is 
defined by the sign of det Gi^2;3- Thus, using Eq. ([60| at 
j — S, we may formulate a modification of Proposition [4] 
for this particular case. 

Proposition 5. For three different equiprobable states pj 
given by their Bloch vectors Pj, j = 1,2,3, the optimal 



POVM includes three elements if the following inequali- 
ties. 



(/3i+/32)-(/?3-/3i)<0, 
03 + Pi) ■ 02 ' Ps) < , 
02 + Ps) ■ 01 - P2) < 



(62) 
(63) 
(64) 



hold. Otherwise, i.e., if at least one of these inequalities 
is violated the optimal measurement is projective. 

If for instance (/3i -1-/32) • {P3 — Pi) > then the optimal 
measurement is the one which optimally distinguishes the 
state pi from p2. In the coordinate system where 



Pi = iPl,uPl,2,Pl,3), 
P2 = (/3l,l,-/3l,2,/3l,3), 
03 = [P3,l,P3,2,Pl,3), 

/3l,l>0, /3l.2>0, Pi.3>0 

the optimal success probability reads 

Pcorr = Ai^2 ~ '^^^ ~^ '^Pl-2) ■ 



(65) 
(66) 
(67) 

(68) 



(69) 



For the generalized measurement, the system of Eqs. 
(f37l) . p6|l. and (fT7|) in the frame ([65|) -([67 | acquires a sim- 
ple form and one easily obtains that the vector B points 
to the positive direction of the z axis, B = (0,0, ^Pi^s), 
and the optimal success probability depends on the com- 
mon projection of the vectors onto the same axis 



Pcorr ^ A=p{l+ Jl^APl^) , p=-. (70) 



(71) 



For the frequencies Uj, one gets 

^ TPi,iP3,2 - Pl,2P3,l ^ Wl,l 

PiAPl.i- P^,i) ' P1.1-P3.1 



From the proposition [Sj it follows that inequalities 
([M)) imply positivity of the frequencies ([7T|) . 

From Eq. (|20p. we find the vectors 7^, j ~ 
1,2,3, defining the detection operators Tij, 7j = (1 — 
4/3i,3)~"^^^(/3j,i, /3j,2, 0). We thus see that these vectors 
lay in the z = Q plane and differ from the projection 
of the vectors Pj onto the same plane by the factor 
(1 — 4/3f g)^^/'^. In particular, if the vectors Pi, P2, and 

P3 are coplanar, i.e., if /3i_3 — 0, the measurement is gen- 
eralized provided these vectors themselves may form a 
POVM. 

It is also useful to represent the success probability 
(17011 in a covariant form 



Pcorr^pl 1+ V1-4/3-/3 



1 

^=3 



(72) 
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^ |l/?lX/?2 + /?2X/?3 + /93X/3i||2 ^ > 

where by oblique cross, we denote the vector product 
and by brackets the mixed product of three-dimensional 
vectors. 

Once the operators tt^ and frequencies ujj are de- 
termined we may calculate the cost matrix T. Thus 
from Eq. ([5]), it follows that this matrix is diagonal 
r = diag(A/2-K/3i^3,^/2-/?i,3). 

Now, assuming that the states pi, p2, and are 
discriminated by a generalized measurement and using 
Proposition [1] we can formulate conditions when the 
optimal measurement for discriminating among > 4 
equiprobable states given by their Bloch vectors Pj — 
{f3j,i, f3j,2, Pj.s), j — l,---,^, is the same which is op- 
timal for the first three states. This is the case if both 
the trace and determinant of the matrices Gj = F — ypj , 
j = 4, . . . , A^, are non-negative. Since the trace is always 
positive, TrGj = p-y^l — 4/?i^3 > 0, it remains to analyze 
the sign of the determinant detG^- = /3i,3(/3j,3 — (ii.'i)- 
Since /9i 3 is assumed to be positive, det Gj > if 
/3j,3 — /3i_3 > 0. From here, it follows that if all the 
additional vectors in the chosen coordinate system share 
the same hemisphere of the Bloch sphere and their lati- 
tude is grater than or equal to the latitude of the three 
given states, then they are discriminated by the same 
measurement as the three given states and the optimal 
success probability is given by Eq. ([70|l [or Eq. ([72|l ] with 
■p ~ \/N . If the vectors are given in another coordinate 
system, the above condition takes the form 

/3-(/3j-/3)>0, J =4,..., TV (74) 

with f3 given in Eq. ([75)1 . 

The optimization problem for three equiprobable 
states plays a fundamental role since for TV > 4 equiprob- 
able states, the following statement takes place: 

Proposition 6. Let N > A different equiprobable states 
Pj be given by their Bloch vectors Pj , j = 1, . . . , N . Then, 
(i) if no vector B =/= exists such that 

B-P,=B-p,, j^2,...,N (75) 

and the vectors [3j, j — 1, . . . , N cannot form a POVM, 
i.e., no numbers LUj > 0, j — 1,...,N .such that 

'^J^i ojj = 2 and X^^Li '^jPj — exist, then no solu- 
tion to the optimization problem, containing N POVM 
elements, exists; 

(a) if under the same assumption for the vector B the 
vectors Pj can form a POVM, then — Pj, j = 
1, . . . , TV, and Pcorr = A = 2p= 2/N; 
(Hi) if a vector B ^ satisfying the conditions |75[ ) ex- 
ists then the optimal measurement may always be realized 
with either two or three POVM elements. 

Proof. The statement (i) becomes evident in view of Eq. 
((20)1 which tells us that for B = Q, a solution is possible 
only if 7j = Pj, in which case iTj — pj. 



The statement (ii) is a particular case of a solution 
previously indicated by Yean et al. 0] ■ 

Thus it remains to prove the statement (iii). For that, 
we will use the induction method and first prove the 
statement for TV = 4. If for = 4 there exists a vector 
B ^ satisfying Eq. ([75]) . then the system ([55]) has a 
nontrivial solution and we easily find that in the coordi- 
nate system ([55 |) - (p7)) . the vector B points to the positive 
direction of the z axis. Using Eqs. (|20p . ([TT]) and (|16p at 
k = 3, we find its exact value, B ~ (0, 0,p/3i_3), p = 1/4. 

To find the frequencies LUj, j — 1, ... ,4, we have to 
solve two from three equations given in Eq. (I16p (at 
k = 1,2) together with Eq. ([T7]). From these equations 
one can always express say wi, lu2, and ^3 in terms of 
UJ4 but only solutions with all 0;^ > 0, A; = 1, . . . , 4 are 
suitable for our purpose. It is important to stress that 
'^1,2,3 are linear (and hence monotonous) functions of 1^4. 
If we put U4 = 0, we find unique values for wi, W2, and 
UJ3 since corresponding linear inhomogeneous system has 
a unique solution. If additionally all these frequencies 
are nonnegative, ujj > 0, j = 1,2,3, then the optimal 
measurement contains three POVM elements tti, 7r2, and 
7r3 and the statement is proven. Let at ti>4 = at least one 
of ujj, j = 1, 2, 3 is negative, say, for definiteness cji < 
and [^2 > 0, W3 > 0. If the optimal measurement with 
four POVM elements exists, then taking into account a 
monotonous dependence of wi.2,3 on W4, we conclude that 
there exists a value W4 = 0-140 > such that wi = and 
C1J2 > 0, 0^3 > and the optimal measurement contains 
no more than three POVM elements 7r4, tts, and ^2 which 
also tells us that the statement is proven. If the optimal 
measurement with four POVM elements does not exist, 
then it contains cither three or two POVM elements since 
by assumptions that the given states are different from 
each other it cannot contain one element. Thus for N — 
4, the statement is proven. 

Assume the statement be correct for N Bloch vectors 
(the main assumption) and prove it for TV -f 1 vectors. 
For the general case in the coordinate system (|65l) - ([57)) 
from Eqs. and ([HI), we find the same 

value for the vector B as it was for TV = 4 with the 
sole difference that now p = 1/(TV -1-1). To find TV -I- 1 
frequencies LUj, we have to solve the same Eqs. (jl6l) at 
k = 1,2 and Eq. I|17p from which we can always express 
three of them, say, tui, 0^2, and tu^ as linear (and hence 
monotonous once again) functions of ujj, j = 4, . . . , TV -|- 
1. Let us put ujj — 0, j = 4, . . . , N + 1. If we find 
> 0, j = 1,2,3, the statement is proven. Therefore, 
assume that at least one of these frequencies is negative, 
say, for definiteness wi < and W2 > 0, W3 > 0. The 
situation here is a bit more complicated than it was for 
TV = 4 since the dependence of every ujj, j — 1, 2, 3 on the 
other frequencies may be increasing with respect to some 
frequencies and decreasing for some others. If there exists 
a measurement with TV -I- 1 POVM elements, different 
scenarios may take place but in all cases, there should 
exist a set of nonnegative frequencies. In particular, it 
may happen that there exists such values of = u>jo > 0, 
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j = 4, . . . , N+1 that wi = and (M2 > 0, > 0. Another 
possibihty may be for instance such that at uji = 0, at 
least one of two other frequencies becomes negative, say, 
LU2 < 0, but with a variation of the frequencies coj, j — 
4, . . . , A^ + l the frequency L02 should become positive and 
hence should cross the point L02 = 0. In other words, the 
set of the values which a;i^2.3 take under the variation 
of S [0, 2], j = 4, . . . , iV + 1 necessarily contains a 
point where at least one of wi, 0^2, and UJ3 vanishes. This 
means that the optimal measurement should contain no 
more than N POVM elements. For wi = and UJj > 0, 
j — 2, . . . , + these are tt2, ■ ■ ■ , ttjv+i- Therefore, the 



matrix F = p ^ 



JV+l 

i=2 



jTTjPj satisfy conditions ^ and ([7]) 



with Pj = p ^ + !)• Since p is simply a scaling 

factor both for F and in Eq. ([7]) , these conditions remain 
valid after the replacement N + 1 N. This means 
that the same measurement is optimal for N states p2, 
. . . , pn+1 and according to the main assumption, there 
exists an optimal measurement which contains either two 
or three POVM elements. □ 

From this proposition we can extract the following 
corollary: 

Corollary 1. For N > A equiprobable states pj given 
by their Bloch vectors Pj , the optimal measurement may 
contain M > 4 POVM elements only if there exists a 
subset of M states which form a PO VM. 

Proof. Indeed, if for N states only B = satisfies Eqs. 
([75)1 and the states pj cannot form a POVM, then one 
has to examine all N subsets of — 1 states and check 
whether there exists a subset which may form a POVM. 
If the answer is positive, for instance the vectors pj, 
j — 1, . . . , N — 1, form a POVM, i.e., there exist num- 
bers ujj > such that X^jLi^ ^jPj = I 01 equivaientiy 

such that J2j=i — 2 and '^jPj — 0) then as a 

POVM for iV states, we may choose just these frequen- 
cies together with the states iVj = pj, j = 1, . . . , N and 
put ujn = 0. For this POVM, the cost matrix ^ reads 
F = pYl!j=i^]P] = PH^=i^jP] = Pl- Therefore, for 
any state p given by the Bloch vector /3 = /32, /^a) 
for the trace and determinant of the matrix G = F — pp, 
one gets TrG = p, det G = ^^(1/4 ~ Pi - Pi ~ /3|) = 0. 
This result means that the chosen POVM is optimal for 
discriminating among all N states pj. 

If the answer is negative, i.e., if such a subset of iV — 1 
states does not exist, one has to examine all subsets of 

— 2 states, etc. From here, it is clear that if there exists 
a subset of M < A^ states which may form a POVM, then 
the optimal POVM for discriminating among A" states 
consists of the POVM thus found supplemented by other 
states (for M < N) with zero frequencies. If for a subset 
of the states there exists a vector B ^ as a solution 
to the Eqs. ([75]) . then according to the point (iii) of the 
proposition [51 the optimal measurement cannot contain 
more than three POVM elements. □ 



2. Two from three states are equally likely 

Consider the case when pi = P2 = P and p3 = 1 — 2p, 
< p < 1/2. Proposition m permits us to indicate the 
cases when three states given with the above probabilities 
are discriminated by the same projective measurement as 
two states given with equal probabilities. It is convenient 
to choose a frame such that 

Pi = (/3i,i,0,/3i,3), /32 = (-/3i,i,0,/3i,3), 



Pi,i > 0, /?i,3 = ^1/4 - > ■ 



From Eq. we find Ai,2 = p(l + 2Pis) > I - 2p 

or equivaientiy p > {3 + 2/3i^i)^^. Another condition 
follows from nonnegativity of det Gi.2;3 given in Eq. (j59[) 
at j = 3 

detGi,2;3 = pPui{3p + 2pPi,i - 1) 

-H2p(2p-l)(l/4-/?i,3/33,3) >0. 

Since both Pi,i and /3i.3 are assumed to be positive and 
hence l + 3/9i,i + 2/?j^ -4/3i,3/33,3 > from here, it follows 
that 



P 



> 



1 + 2/3i,i - 4/?i,3/?3,3 



> 



1 



2(1 + 3/31, 1 + 2pl, - 4/31,3/33,3) ' 3 + 2/3i,i 

In particular at Ps^^ = 1/2, /3i,i = isin(2f?), and /3i,3 — 
i cos(26'), < 9 < tt/2, the last inequality takes the form 

2 1 



P> 



> 



5 + cos(26l) + sin(26l) 3 + sin(26l) 



and one recognizes the condition found in [10] for distin- 
guishing the optimal projective measurement from the 
generalized one for discriminating among three mirror 
symmetric states. 

Another simplification occurs at /?3_3 — 0, 

det Gi,2;3 = |(1 + 2/3i,i)(2p + 2p/3i,i - 1) , 
which leads to 



P> 



1 



> 



1 



2 + 2/3i4 2 + sin(26') 3 + sin(26l) 



Note that this case was previously studied numerically 
in [11] and we find instructive to compare our analytic 
solution with the numerical one. For this purpose, we 
additionally put P3^2 = 0, /33,i = |, and /3i,i = —P2,i — 
|j /3i,3 = p2,3 = ^-y/l — 6^, b> 0. Solving simultaneously 
Eqs. ([55)1 and the compatibility condition for the system 
(fTB)) . we find the vector 



B 



[ A{3p-1) 

\2pVT^ 



,0,0 



which being put to Eq. ([39)l with a = 1 gives a second- 
order equation with respect to A. This equation has the 
roots A = and 

2p^(l-2p)(6^-l) 
l-6p + p'^{8 + b'^) ■ 
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With these values of Bj and A we solve the system of 
equations ^ 



UJl.2 = ± 



l+p[4(p-l) + 62(5p-2)] 



6[1 + 39(-6 + (8 + 62)p)]2 
X [-1 + p{i - 4p + b[T2 + {±6 + b)p])] , (76) 
W3 = 2 - LJi - UJ2 ■ (77) 

From here, it follows that the equation oja = has only 
one real root inside the interval < p < 1/2, which 
is p = p,. = (2 + 6)"^ = (2 + 2/3i,i)"i and if p > Pr, 
C1J3 < 0. The equation lu2 = has no real roots inside 
this interval and the equation lui ~ has one root also 
p = pi = {b-2 + y^2b{l + b))/{b^ + 6& - 4). Thus, for 
pi < p < Pr, these states are discriminated by a gener- 
alized measurement. Otherwise, they are discriminated 
by a projective measurement. In contrast to the paper 
[ll| . we give exact analytic bounds for the probability 
p distinguishing optimal generalized measurements from 
projective ones. 



C. N=4 

Let two states pi and p2 be given with the probability 
p and two other states and p4 with the probability 
1/2 — p, < p < 1/2. Let us choose the coordinate 
system such that the z = plane is the plane of the Bloch 
vectors (32 and they are symmetrically disposed with 
respect to x axis, i.e., 

Pi = (/3ia, /3i.2, 0) , p2 = (/3i4, -/3i,2, 0) , 

and assume that /3i^i > and /3i,2 > 0. Here, we will 
consider the case where the vectors P3 and /34 are placed 
into the x — plane and symmetrically disposed with 
respect to z axis, i.e., 

/33 = (0,/?3,2,/?3,3), P2 = (0,/33,2,-/93,3) 

with /33,2 > and /?3,3 > 0. 

From the system ([57]) . we find both the vector B ~ 
{Bi, 0, 0) and the optimal success probability Pcorr — A, 



Bi 



2p/3i,i(l-6p + 8p2) 
l-8p+16p2(l-/32 J 



if for discrimination of these states, one needs four detec- 
tion operators 

From here, we see that Pcorr does not depend on the 
states p3 and p^. In particular, these states may be or- 
thogonal to each other (/33^2 = 0) in which case they 
correspond to opposite points of the Bloch sphere. With 
these values of Si, B2, B^, and A, from the system of 
equations and P?|) . we find the frequencies 



Wi,2 = ±- 



1 + Ap[p~l + Ai3p~l)l3h] 



"/3i,2[l + 4p(-2+p[3 + 4/32 2])]2 
X [±4p(4p-l)/3i,2 + (-l + 4p[l-4p/32^] )/33,2] ,(79) 



CJ3 = CJ4 = 1 - -{cji 



(80) 



Note that for (3^ 2 = 0, when the states p^ and p4 are 
orthogonal to each other, uji = uj2- From here it follows 
that for a fixed value of (and hence Pi. 2), t<^4 is not 
positive, a;4 < 0, if 



P> 



1 - 2/3i.2 
8/32 



Pr 



and oji > if 



Pl,2 - 133,2 + J/3l,2(/3l,2+ 2/33,2 +4/3i,2/3|,2) 
P > — . n 



51,2 



2/33,2 



'1.2/^3,2 



whereas 0^2 > inside the interval pi < p < Pr- Note 
that for /3i,2,/?3,2 G [0, 1/2], the difference pr — pi is not 
negative, Pr — Pi > 0. Thus, for the p values from the 
interval p G (j)i,Pr), the states are discriminated with 
four detection operators %j — ujjTTj, j = 1, . . . , 4. For p > 
Pr, they are discriminated by a projective measurement 
which optimally distinguishes pi from p2 ■ For the success 
probability, one finds from Eq. ([75]) at p = Pr the value 
Pcorr = 1/2. Note that Eqs. ^ and ^ at pi = 
P2 — Pr yield the same value for the success probability. 
With p growing from pr up to 1/2, the success probability 
linearly increases until the value 1/2 -I- /3i,2. 

Kt p — pi, uji vanishes, wi — 0, and for p < p/, the 
states are optimally discriminated by the same measure- 
ment which optimally discriminates among the states p2, 
ps, and p4. From Eqs. (|7^ and ([50)1 at p = p/, we obtain 
corresponding frequencies 



W3 = LU4 



/3l,2 + /?3,2 



/3l,2 + 2/33,2 + 4/3i,2/3: 



a;2 = 2 — 2w3 



3,2 



(81) 

Note that for < /3i_2,/33,2 < 1/2 all the frequencies 
([5T|) are positive. On the other hand, for /?3.2 = 0, when 
the state ps is orthogonal to p4, pi = 1/4, ^3 — UJ4 — 
1, and LOi = UJ2 = 0. The success probability in this 
case is Pcorr — A — 1/2. Note also that the same value 
follows from Eqs. (HH) and ((55|) at pi = p2 = 1/4 and 
Pi • 02 = —1/4. When p ^ 0, the success probability 
tends linearly to zero. 



IV. CONCLUSION 

The known necessary and sufficient conditions for dis- 
criminating between nonorthogonal qubit states with the 
minimum error are formulated in terms of Bloch vectors 
representing the states. This permitted us to indicate 
an algorithmic solution to the direct optimization prob- 
lem and give a complete solution to the inverse optimiza- 
tion problem. By the direct optimization problem, we 
mean the problem of finding the optimal measurement 
strategy when the states to be discriminated are given 
together with their prior probabilities. Accordingly, the 
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inverse optimization problem is the problem of specify- 
ing all possible states and their prior probabilities which 
may be optimally discriminated by the given generalized 
measurement. 

An intermediate (or mixed) optimization problem may 
also be important for practical usage. We formulate it as 
follows. Assume that the minimum-error discrimination 
strategy for N given states is known. Is it possible to 
enlarge this set of states in such a way that they are opti- 
mally discriminated by the same measurement strategy? 
If yes, what are the conditions to which the additional 
states should satisfy? A simple analysis of the necessary 
and sufficient conditions for minimizing the generalized 
rate error tells us that the answer to this question is pos- 
itive. We formulated conditions in terms of Bloch vec- 
tors when a set of N qubit states is discriminated by the 
same orthogonal measurement which is optimal for dis- 
criminating among two states given with arbitrary prior 
probabilities. We applied this result to formulate condi- 
tions when three qubit states are optimally discriminated 
either by the generalized measurement or by the projec- 
tive one. In particular for N equiprobable states, we have 
shown that the optimal generalized measurement may 
contain M > 4 POVM elements only if the states them- 
selves may form a POVM. Otherwise, it contains either 
three or two POVM elements. For three equiprobable 
states, we found conditions which distinguish projective 



optimal measurements from generalized ones. 

To illustrate our approach, we considered an example 
of three states, two of which are equally likely. For a 
particular case of three mirror symmetric states, we re- 
produced a result previously published in |l3|. To illus- 
trate the advantage of the analytic solution compared to 
a numerical one, we considered another particular case 
previously reported in [Tlj . For this case in contrast to 
|ll| . we were able to present the strict analytic bounds 
for the prior probability for distinguishing the optimal 
generalized measurement from the projective one. As 
the final illustration of our approach , we solved the op- 
timization problem for four qubit states, two of which 
are given with the probability p and two others with the 
probability 1/2 — p. 
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